5 |
Parsing with Pretrained Language Models, Multiple Datasets, and Dataset Embeddings ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
On the Effectiveness of Dataset Embeddings in Mono-lingual,Multi-lingual and Zero-shot Conditions ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Genre as Weak Supervision for Cross-lingual Dependency Parsing ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
We Need to Talk About train-dev-test Splits ...
|
|
|
|
Abstract:
Anthology paper link: https://aclanthology.org/2021.emnlp-main.368/ Abstract: Standard train-dev-test splits used to benchmark multiple models against each other are ubiquitously used in Natural Language Processing (NLP). In this setup, the train data is used for training the model, the development set for evaluating different versions of the proposed model(s) during development, and the test set to confirm the answers to the main research question(s). However, the introduction of neural networks in NLP has led to a different use of these standard splits; the development set is now often used for model selection during the training procedure. Because of this, comparing multiple versions of the same model during development leads to overestimation on the development data. As an effect, people have started to compare an increasing amount of models on the test data, leading to faster overfitting and ``expiration'' of our test sets. We propose to use a tune-set when developing neural network methods, which can ...
|
|
Keyword:
Computational Linguistics; Machine Learning; Machine Learning and Data Mining; Natural Language Processing
|
|
URL: https://underline.io/lecture/37384-we-need-to-talk-about-train-dev-test-splits https://dx.doi.org/10.48448/x2hz-kd89
|
|
BASE
|
|
Hide details
|
|
9 |
Genre as Weak Supervision for Cross-lingual Dependency Parsing ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
DaN+: Danish Nested Named Entities and Lexical Normalization ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken Language Understanding ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
From Masked-Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken Language Understanding ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Lexical Normalization for Code-switched Data and its Effect on POS-tagging ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Fair Is Better than Sensational: Man Is to Doctor as Woman Is to Doctor
|
|
|
|
In: Computational Linguistics, Vol 46, Iss 2, Pp 487-497 (2020) (2020)
|
|
BASE
|
|
Show details
|
|
15 |
Bleaching Text: Abstract Features for Cross-lingual Gender Prediction ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|